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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/982,828 



DATE: 03/05/2002 
TIME: 06:29:48 



INPUT SET. S36790.raw 




General Information: 



This Raw Listing contains the General 

Information Section and up to the first 5 pages, ~^ffiVEQ 

MAR - 5 2002 
TECH CENTER 1600/2900 



SEQUENCE LISTING 



(i) APPLICANT: Murphy, Patricia D. 

Allen, Antonette C. 
Alvares, Christopher P. 
Critz, Brenda S. 
Olson, Sheri J. 
Thurber, Denise 
Zeng, Bin 



ENTERED 



(ii) TITLE OF INVENTION: Coding Sequences of the Human 

BRCAl Gene 

(iii) NUMBER OF SEQUENCES : 72 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Morgan Lewis & Bockius LLP 

(B) STREET: 1111 Pennsylvania Avenue N. W. 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP : 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

■ (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/982,828 

(B) FILING DATE: 2001-10-22 

(C) CLASSIFICATION: 

(vii) PRIORITY APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/074,453 

(B) FILING DATE: 1998-05-06 

(A) APPLICATION NUMBER: US 08/798,691 

(B) FILING DATE: 1997-02-12 

(A) APPLICATION NUMBER: US 08/598,591 

(B) FILING DATE: 1996-02-12 
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RAW SEQUENCE LISTING DATE: 03/05/2002 

PATENT APPLICATION US/09/982,828 TIME: 06:29:49 

INPUT SET; S36790.raw 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME : Michael S . Tuscan 

(B) REGISTRATION NUMBER: 43,210 

(C) REFERENCE /DOCKET NUMBER: 4492 1- 5053 - 01 -US 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-739-3000 

(B) TELEFAX: 202-739-3001 

m - 5 2002 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Not Relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: BRCA1 (omil) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 17 

(B) MAP POSITION: 17q21 




HCEW EfU600|2900 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 60 

CCTGCGCTCA GGAGGCCTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAAA 12 0 

TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT ATGCAGAAAA 180 

TCTTAGAGTG TCCCATCTGT CTGGAGTTGA TCAAGGAACC TGTCTCCACA AAGTGTGACC 24 0 

ACATATTTTG CAAATTTTGC ATGCTGAAAC TTCTCAACCA GAAGAAAGGG CCTTCACAGT 3 00 

GTCCTTTATG TAAGAATGAT ATAACCAAAA GGAGCCTACA AGAAAGTACG AGATTTAGTC 36 0 

AACTTGTTGA AGAGCTATTG AAAATCATTT GTGCTTTTCA GCTTGACACA GGTTTGGAGT 42 0 

ATGCAAACAG CTATAATTTT GCAAAAAAGG AAAATAACTC TCCTGAACAT CTAAAAGATG 48 0 

AAGTTTCTAT CATCCAAAGT ATGGGCTACA GAAACCGTGC CAAAAGACTT CTACAGAGTG 540 
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PATENT APPLICATION US/09/982,828 TIME: 06:29:49 
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AACCCGAAAA 


TCCTTCCTTG 


CAGGAAACCA 


GTCTCAGTGT 


CCAACTCTCT 


INPUT SET: S36790.raw 

AACCTTGGAA 600 


CTGTGAGAAC 


TCTGAGGACA 


AAGCAGCGGA 


TACAACCTCA 


AAAGACGTCT 


GTCTACATTG . 


660 


AATTGGGATC 


TGATTCTTCT 


GAAGATACCG 


TTAATAAGGC 


AACTTATTGC 


AGTGTGGGAG 


720 


ATCAAGAATT 


GTTACAAATC 


ACCCCTCAAG 


GAACCAGGGA 


TGAAATCAGT 


TTGGATTCTG 


780 


CAAAAAAGGC 


TGCTTGTGAA 


TTTTCTGAGA 


CGGATGTAAC 


AAATACTGAA 


CATCATCAAC 


840 


CCAGTAATAA 


TGATTTGAAC 


ACCACTGAGA 


AGCGTGCAGC 


TGAGAGGCAT 


CCAGAAAAGT 


900 


ATCAGGGTAG 


TTCTGTTTCA 


AACTTGCATG 


TGGAGCCATG 


TGGCACAAAT 


ACTCATGCCA 


960 


GCTCATTACA 


GCATGAGAAC 


AGCAGTTTAT 


TACTCACTAA 


AGACAGAATG 


AATGTAGAAA 


1020 


AGGCTGAATT 


CTGTAATAAA 


AGCAAACAGC 


CTGGCTTAGC 


AAGGAGCCAA 


CATAACAGAT 


1080 


118 
119 
120 
121 
122 

1 0"l 
X o 

124 
125 


GGGCTGGAAG 


TAAGGAAACA 


TGTAATGATA 


GGCGGACTCC 


CAGCACAGAA 


AAAAAGGTAG 


1140 


ATCTGAATGC 


TGATCCCCTG 


TGTGAGAGAA 


AAGAATGGAA 


TAAGCAGAAA 


CTGCCATGCT 


1200 


CAGAGAATCC 


TAGAGATACt 


GAAGATGTTC 


CTTGGATAAC 


ACTAAATAGC 


AGCATTCAGA 


1260 


AAGTTAATGA 


GTGGTTTTCC 


AGAAGTGATG 


AACTGTTAGG 


TTCTGATGAC 


TCACATGATG 


1320 


126 
127 
128 
129 
130 
131 
132 
133 


GGGAGTCTGA 


ATCAAATGCC 


AAAGTAGCTG 


ATGTATTGGA 


CGTTCTAAAT 


GAGGTAGATG 


1380 


AATATTCTGG 


TTCTTCAGAG 


AAAATAGACT 


TACTGGCCAG 


TGATCCTCAT 


GAGGCTTTAA 


1440 


TATGTAAAAG 


TGAAAGAGTT 


CACTCCAAAT 


CAGTAGAGAG 


TAATATTGAA 


GACAAAATAT 


1500 


TTGGGAAAAC 


CTATCGGAAG 


AAGGCAAGCC 


TCCCCAACTT 


AAGCCATGTA 


ACTGAAAATC 


1560 


134 
135 
136 
13 7 
138 
13 9 
140 
141 


TAATTATAGG 


AGCATTTGTT 


ACTGAGCCAC 


AGATAATACA 


AGAGCGTCCC 


CTCACAAATA 


1620 


AATTAAAGCG 


TAAAAGGAGA 


CCTACATCAG 


GCCTTCATCC 


TGAGGATTTT 


ATCAAGAAAG 


1680 


CAGATTTGGC 


AGTTCAAAAG 


ACTCCTGAAA 


TGATAAATCA 


GGGAACTAAC 


CAAACGGAGC 


1740 


AGAATGGT C A 


AGTGATGAAT 


ATTACTAATA 


GTGGTCATGA 


GAATAAAACA 


AAAGGTGATT 


1800 


142 
143 
144 
145 
146 
147 


CTATTCAGAA 


TGAGAAAAAT 


CCTAACCCAA 


TAGAATCACT 


CGAAAAAGAA 


TCTGCTTTCA 


1860 


AAACGAAAGC 


TGAAC CTATA 


AGCAGCAGTA 


TAAGCAATAT 


GGAACTCGAA 


TTAAATATCC 


1920 


ACAATTCAAA 


AGCACCTAAA 


AAGAATAGGC 


TGAGGAGGAA 


GTCTTCTACC 


AGGCATATTC 


1980 


148 
149 


ATGCGCTTGA 


ACTAGTAGTC 


AGTAGAAATC 


TAAGCCCACC 


TAATTGTACT 


GAATTGCAAA 


2040 


150 
151 
152 


TTGATAGTTG 


TTCTAGCAGT 


GAAGAGATAA 


AGAAAAAAAA 


GTACAAC CAA 


ATGCCAGTCA 


2100 


GGCACAGCAG 


AAACCTACAA 


CTCATGGAAG 


GTAAAGAACC 


TGCAACTGGA 


GCCAAGAAGA 


2160 
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153 










INPUT SET: S36790.raw 
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160 
161 
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167 
168 
169 
170 
171 
172 
173 
174 
175 
176 

i IT 

± 1 / 

178 
179 
180 
181 
182 
183 
184 
185 
186 

1 Q H 


GTAACAAGCC 
AGTTAACAAA 


AAATGAACAG 
TGCACCTGGT 


ACAAGTAAAA 
TCTTTTACTA 


GACATGACAG 
AGTGTTCAAA 


TGATACTTTC 
TACCAGTGAA 


CCAGAGCTGA 
CTTAAAGAAT 


2220 
2280 


TTGTCAATCC 


TAGCCTTCCA 


AGAGAAGAAA 


AAGAAGAGAA 


ACTAGAAACA 


GTTAAAGTGT 


2340 


CTAATAATGC 


TGAAGACCCC 


AAAGATCTCA 


TGTTAAGTGG 


AGAAAGGGTT 


TTGCAAACTG 


2400 


AAAGATCTGT 


AGAGAGTAGC 


AGTATTTCAC 


TGGTACCTGG 


TACTGATTAT 


GGCACTCAGG 


2460 


AAAGTATCTC 


GTTACTGGAA 


GTTAGCACTC 


TAGGGAAGGC 


AAAAACAGAA 


CCAAATAAAT 


2520 


GTGTGAGTCA 


GTGTGCAGCA 


TTTGAAAACC 


CCAAGGGACT 


AATTCATGGT 


TGTTCCAAAG 


2580 


ATAATAGAAA 


TGACACAGAA 


GGCTTTAAGT 


ATCCATTGGG 


ACATGAAGTT 


AACCACAGTC 


2640 


GGGAAACAAG 


CATAGAAATG 


GAAGAAAGTG 


AACTTGATGC 


TCAGTATTTG 


CAGAATACAT 


2700 


TCAAGGTTTC 


AAAGCGCCAG 


TCATTTGCTC 


TGTTTTCAAA 


TCCAGGAAAT 


GCAGAAGAGG 


2760 


AATGTGCAAC 


ATTCTCTGCC 


CACTCTGGGT 


CCTTAAAGAA 


ACAAAGTCCA 


AAAGTCACTT 


2820 


TTGAATGTGA 


ACAAAAGGAA 


GAAAATCAAG 


GAAAGAATGA 


GTCTAATATC 


AAGCCTGTAC 


2880 


AGACAGTTAA 


TATCACTGCA 


GGCTTTCCTG 


TGGTTGGTCA 


GAAAGATAAG 


CCAGTTGATA 


2940 


ATGCCAAATG 


TAGTATCAAA 


GGAGGCTCTA 


GGTTTTGTCT 


ATCATCTCAG 


TTCAGAGGCA 


3000 


ACGAAACTGG 


ACTCATTACT 


CCAAATAAAC 


ATGGACTTTT 


ACAAAACCCA 


TATCGTATAC 


3060 


CACCACTTTT 


TCCCATCAAG 


TCATTTGTTA 


AAACTAAATG 


TAAGAAAAAT 


CTGCTAGAGG 


3120 


AAAACTTTGA 


GGAACATTCA 


ATGTCACCTG 


AAAGAGAAAT 


GGGAAATGAG 


AACATTCCAA 


3180 


188 
189 
190 

1 Q1 

192 
193 
194 
195 
196 
197 


GTACAGTGAG 


CACAATTAGC 


CGTAATAACA 


TTAGAGAAAA 


TGTTTTTAAA 


GGAGCCAGCT 


3240 


CAAGCAATAT 


TAATGAAGTA 


GGTTCCAGTA 


CTAATGAAGT 


GGGCTCCAGT 


ATTAATGAAA 


3300 


TAGGTTCCAG 


TGATGAAAAC 


ATT C AAGC AG 


AACTAGGTAG 


AAACAGAGGG 


CCAAAATTGA 


3360 


ATGCTATGCT 


TAGATTAGGG 


GTTTTGCAAC 


CTGAGGTCTA 


TAAACAAAGT 


CTTCCTGGAA 


3420 


GTAATTGTAA 


GCATCCTGAA 


ATAAAAAAGC 


AAGAATATGA 


AGAAGTAGTT 


CAGACTGTTA 


3480 


198 
199 
200 
201 


ATACAGATTT 


CTCTCCATAT 


CTGATTTCAG 


ATAACTTAGA 


ACAGCCTATG 


GGAAGTAGTC 


3540 


ATGCATCTCA 


GGTTTGTTCT 


GAGACACCTG 


ATGACCTGTT 


AGATGATGGT 


GAAATAAAGG 


3600 


202 
203 
204 


AAGATACTAG 


TTTTGCTGAA 


AATGACATTA 


AGGAAAGTTC 


TGCTGTTTTT 


AGCAAAAGCG 


3660 


TCCAGAGAGG 


AGAGCTTAGC 


AGGAGTCCTA 


GCCCTTTCAC 


C CAT AC AC AT 


TTGGCTCAGG 


3720 
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237 
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248 
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250 
251 
252 
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254 
255 
256 
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258 


GTTACCGAAG 


AGGGGPPAAG 

-P1VJ VJ VJVJ V_ V,, n£\\J 


A A ATT AG APT 


PPTPAPA APA 


P A APTTATPT 
VJ.H_H.\_, 1 1A1L1 


APTP APP ATP 
Avj 1 vjAvjvj A 1 VJ 


■.TOO 

j / ou 


AAGAGCTTCP 


PTGPTTPPAA 


p A p TTfiTT 1 A T 
UnLl X Vj X X J-\ X 


TTnPTA A Af^lT 
X X X nntvj X 


A A A P A A T A T* A 
AAALAA1A1A 


PPTTPTP APT 1 

bL 1 lbl bAb 1 


3 840 


PTAPTAGGPA 


TAGPAPPGTT 


PPTAPPPAPT 
vj\_ X.ribv,vj.H.b X 


fZTPTPTPTA A 


paapapapap 
vjA>\v_.Av_-/\L7/\vj 


P A P A A r P r P r n A T 1 

bAbAAl 1 1A1 


j y uu 


TATCATTGAA 


GAATAGPTTA 


AATGAPTGPA 


GTAAPPAGGT 


A AT ATTPPP A 


A APPP ATPTP 
AAbbLA 1 b 1 b 




AGGAAGATGA 


PPTTAGTGAG 

v_ \_ x X rWj X \Jr\ vj 


PA A APA A A AT 


PTTPTPPTAP 


PTTP TT'PTPT 1 
LI lbl I I 1LI 


TP A PAP TP P A 

1 bAbAb 1 bbA 


4Uz U 


GTGAATTGGA 

vj x vj,e-w"i. x x vjvjfrl 


AGAPTTGAPT 

-Mvj-riv_ X X v-T.M.V__ x 


PP 21 A A T A P 2i 21 
vjv_.HJ-J_rt. 1 AL/in 


AP APPP APP A 


rp rp rp rn /~i rn rp /""i 

ICC 11 ltllb 


All bb 1 1 b 1 1 


408 0 


PPAAAPAAAT 


P APPP ATP AP 


tptpa zi ap.pp 


APPP APTTPP 


1L1 vj Avj 1 bAL 


A A PP A A i'|ii|ipp 

AAbbAAI Ibvj 


4 140 


TTTPAGATPA 


TGAAGAAAGA 

X Vj._r-J^vj.£->_rt^-xvj-f"-. 


PP A A PPPPPT 

VJVJ.rtJ-lV-.VJbVjb X 


TPP A A P A A A A 


TA ATP A APA A 
1 rt-rt, 1 LAAbAA 


PAPPAAAPPA 

bAbLAAAbLA 


zi 0 n n 


TGGATTPAAA 

X UUfl X x V^r-XrtJTl 


PTTAPPTPA A 


PP A PP 21 TPTP 
bb.H.bb,fvl b 1 vj 


PPTPTP AP AP 


rpp A A APA 

1 vjAAAL-AAvjC 


b 1 C_ 1 b TbAAG 


426 0 






b -fiG/ib 1 b.rtb/\ 


111 IAALLAL 


TIPAPPAPAPP 


P 7\ m TV P /"I TV my*"* /"I 

GATACCATGC 


4320 


A AP AT A A PPT 


pa tiv a arrTP 

brt. 1 rt-ri-T-ivjb X V_ 


pappappaaa 

b AG b AG bAAA 


1 vjvjC, 1 CjAAL 1 


A P A A Pprppi"PP 

ACjAACjC 1 (j TG 


mm TV TV TV /"I TV /"I /~1 

T T AG AAC AGC 


43 80 


ATGPGAPPPA 

-ri X \J\J\J,rt.VjV»».V — .,rt. 


PPPTTPTA AP 
VjV_v_X XbX-rinb 


21 PPT 21 PPPTT 
-H.bb x .rtbbb X X 


PPATP ATA AP 


1 bAL 1 L-C 1 C_ 1 


PPPPrnTP 7\ PP 

G b b CTTGAGG 


444 0 


APPTGPGAAA 


TPPAPA APA A 


zi p p 21 p 21 tp a p 

AbC/ibA 1 bAb 


AAAAAPPAPT 
AAA/iif4vjCjrt.vj 1 


Al 1 AAv_ 1 1CA 


PAPA TV TV APrpTV 

G AG AAAAGT A 


4500 


PTPA ATAPPP 
vj X O-rt-rl x i-iv— L, L. 


tata anrrar 


AATPPAPAAP 
AH. X L, b AGAAb 


VJV..L, 1 1 1 L, 1 (jL, 


qip A P A A P n»| 11 r 1 

IvjAL-AACjI 1 1 


P A PPTiprppmP 

G AGGT G T C TG 


4560 


PAGATAGTTP 

Virion x £^\J X X L- 


TAPP APT A Z\ 


AATAAAPAAP 
AA 1 AAAbAAb 


PAPP APTPP A 
v_ AvjvjAvj 1 vjvjA 


A 7\ PPT'PA T'PP 

AAvjCj 1 L.A 1 


pprprppiT»A TV TV rp 

GG1 lblAAAT 


4620 


GPPP ATP ATT 

V V_ X V_jT-i. X X 


APATPATAPP 


TP.PTAP ATPP 
X bb 1 Hb-H. x bb 


A P A P TTP PTP 
Av-Avj 1 1 LtL, 1 v_ 


1 vjvjvjACj 1 L. 1 1 


PAPA ATiAPA A 

GAGAATAGAA 


4680 


APTAPPPATP 


TP A APAPPAP 
X v_,.tt^vj,rt.vjvj-f-i.b 


PTP ATT A A PP 
b 1 C A 1 1 AAbb 


1 lbl 1 VJ A 1 VJ 1 


PPAPPAPPA A 
vjCj AvjvjAvj C AA 


PAPPrpPPTv TvP 

GAGGTGGAAG 


4740 


APTPTPPPPP 

-rivj X V_ X VJVJVJV— k_ 


-r_.b-riV_,b.H.l 1 io 


APPP A A A PAT 
AL, bb AAAb A x 


v_ 1 1AL 1 1 bLL 


a apppa a p a t* 
AAvjCjC AACjA 1 


PT"l A P A PPP TV TV 

G TAGAGGGAA 


480 0 


PPPPTTAPPT 


PP A A TPTPP A 
vjvj,tt-H. 1 1 \J\Jr\ 


ATP APPPTPT 
A 1 bAbbb 1L1 


lLiLl Vj/\ 1 VjA 


PPPTP A A TPT 

v, L. L. 1 tj AA 1 L 1 


P A mPPTTPiPP 

GA1 GG 1 1 GTG 


4860 


A AG AP AG AGP 


PPPAPAPTP A 
v^v^L-.tt.vj.rt.vj X b.H. 


PPTPPTPTTP 


vj LAALA 1 AL L, 


AT'PT'T'PA APP 

A1L1 1 LaALL 


rnprpp P A rprpp TV 

T C TGC AT TG A 


4 920 


A AGTTPPPPA 


ATTPA A APTT 


PP AP A ATPTP 


L-CL-Avjvjvj 1 L.L. 


ACjC 1 bL 1 CjCI 


P TV rp7\ prpTv prriP 

GAT AC TAG TG 


498 0 


ATACTGCTGG 


GTATAATGCA 


ATGGAAGAAA 


GTGTGAGCAG 


GGAGAAGC C A 


GAATTGACAG 


5040 


CTTCAACAGA 


AAGGGTCAAC 


AAAAGAATGT 


CCATGGTGGT 


GTCTGGCCTG 


ACCCCAGAAG 


5100 


AATTTATGCT 


CGTGTACAAG 


TTTGCCAGAA 


AACACCACAT 


CACTTTAACT 


AATCTAATTA 


5160 


CTGAAGAGAC 


TACTCATGTT 


GTTATGAAAA 


CAGATGCTGA 


GTTTGTGTGT 


GAACGGACAC 


5220 


TGAAATATTT 


TCTAGGAATT 


GCGGGAGGAA 


AATGGGTAGT 


TAGCTATTTC 


TGGGTGACCC 


5280 


AGTCTATTAA 


AGAAAGAAAA 


ATGCTGAATG 


AGCATGATTT 


TGAAGTCAGA 


GGAGATGTGG 


5340 



SEQUENCE VERIFICATION REPORT 
PATENT APPLICATION US/09/982,828 



DATE: 03/05/2002 
TIME: 06:29:50 



INPUT SET: S36790.raw 

Original Text 



